On Translation Corpora and Translation Support Tools: A Project Report

نویسندگان

  • Lars Ahrenberg
  • Magnus Merkel
چکیده

For more than forty years machine translation has been an active area of research and development. Several variants of MT systems have been proposed e.g., interactive systems, batch-mode systems requiring a large amount of human post-editing and systems that generate text in several languages from a common structured set of data rather than a source text. Although there are some successful systems and several products on the market, it is still the case that most translators and translation companies prefer not to use MT-systems. In fact, it has been estimated “that the current share of anything that could be called MT, pure or mixed, is well below 1% of the total translation market” (Isabelle et al, 1993).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Collecting and Using Comparable Corpora for Statistical Machine Translation

Lack of sufficient parallel data for many languages and domains is currently one of the major obstacles to further advancement of automated translation. The ACCURAT project is addressing this issue by researching methods how to improve machine translation systems by using comparable corpora. In this paper we present tools and techniques developed in the ACCURAT project that allow additional dat...

متن کامل

Parallel Data, Tools and Interfaces in OPUS

This paper presents the current status of OPUS, a growing language resource of parallel corpora and related tools. The focus in OPUS is to provide freely available data sets in various formats together with basic annotation to be useful for applications in computational linguistics, translation studies and cross-linguistic corpus studies. In this paper, we report about new data sets and their f...

متن کامل

استخراج پیکره‌ موازی از اسناد قابل‌مقایسه برای بهبود کیفیت ترجمه در سیستم‌های ترجمه ماشینی

Data used for training statistical machine translation method are usually prepared from three resources: parallel, non-parallel and comparable text corpora. Parallel corpora are an ideal resource for translation but due to lack of these kinds of texts, non-parallel and comparable corpora are used either for parallel text extraction. Most of existing methods for exploiting comparable corpora loo...

متن کامل

Translation Technology Tools and Professional Translators’ Attitudes toward Them

Today technology is an integral part of professional translation; and it is generally assumed that translators’ attitudes toward translation technology tools influence their interaction with technology (Bundgaard, 2017). Therefore, the present two-phase study seeks to shed some light on what translation technology tools are and how professional translators feel toward them. The research method ...

متن کامل

A Machine Translation Toolchain for Polysynthetic Languages

We present a set of free tools for building rule-based machine translation systems for polysynthetic languages. As there are no large corpora for most of the “small” languages, it is often impossible to use statistical methods. There are some free MT tools but very little work has been done on polysynthetic languages. The aim of this project is to provide computational tools for morphological a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996